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(N ■ Abstract 

>, 

In this paper we explore the use of Answer Set Programming (ASP) to formalize, and reason about, 
psychological knowledge. In the field of psychology, a considerable amount of knowledge is still 
expressed using only natural language. This lack of a formalization complicates accurate studies, 
comparisons, and verification of theories. We believe that ASP, a knowledge representation formal- 
ism allowing for concise and simple representation of defaults, uncertainty, and evolving domains, 
can be used successfully for the formalization of psychological knowledge. To demonstrate the via- 
bility of ASP for this task, in this paper we develop an ASP-based formalization of the mechanics of 
Short-Term Memory. We also show that our approach can have rather immediate practical uses by 
demonstrating an application of our formalization to the task of predicting a user's interaction with a 
graphical interface. 
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1 Introduction 

In this paper we explore the use of Answer Set Programming (ASP) ( Gelfo nd and Lif schitz 1 99 ft 
Mare kand Truszczynski 1999] ) to formalize psychological knowledge and to reason about 
it. The importance of a precise formalization of scientific knowledge has been known for 
a long time (see e.g. Hilbert's philosophy of physics). Most notably, formalizing a body 
of knowledge in an area improves one's ability to (1) accurately study the properties and 
consequences of sets of statements, (2) compare competing sets of statements, and (3) 
design experiments aimed at confirming or refuting sets of statements. In the field of psy- 
chology, some of the theories about the mechanisms that govern the brain have been for- 
malized using artificial neural networks and similar tools (e.g. ( McCarley et al. 2002| l). 
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That approach works well for theories that can be expressed in quantitative terms. How- 
ever, theories of a more qualitative or logical nature, which by their own nature do not 
provide precise quantitative predictions, are not easy to formalize in this way. We be- 
lieve that ASP can be used successfully for the formalization of such bodies of knowl- 
edge. ASP is a knowledge representation formalism allowing for concise and simple 
representation of defaults, uncertainty, and evolving domains, and has been demon- 
strated to be a useful paradigm for the formalization of knowledge of various kinds (e.g. 
dBaral and Gelfond 20051 ISon and Sakama 20091) ). One further advantage of ASP is that it 
is directly executable, in the sense that the consequences of collections of ASP statements 
can be directly, and often efficiently, computed using computer programs. 

To demonstrate the viability of ASP for the formalization of psychological knowledge, 
in this paper we develop an ASP-based formalization of the mechanics of Short-Term 
Memory (STM) and chunking. We selected this theory because it is rather mature and, like 
the general type of psychological knowledge that we aim to formalize, it is mostly of a 
qualitative nature, and is expressed, in the literature, at a rather high level of abstraction. 
Moreover, formalizing it is challenging, because it involves modeling a sophisticated dy- 
namic domain involving non-determinism, fixed-capacity storage, and time-based decay. 
The combination of these features makes it rather difficult to use other languages for the 
encoding. As a confirmation of the benefits of formalizing psychological knowledge, we 
show that our formalization immediately allows us to point out an element of the model 
of STM that may deserve further investigation. We also show that our approach can have 
rather direct practical uses by demonstrating an application of our formalization to the 
task of predicting a user's interaction with a graphical interface. To the best of our knowl- 
edge, ours is the first attempt to use ASP for the formalization of theories about the brain, 
although it is worth mentioning that ASP has been previously applied previously to the 
specification of a user's mental model for recommender systems (ILeite and Ilic 20071 ). Al- 
though for our formalization we follow a rather well-known theory of STM and chunking 
(e.g. dKassin 200 6 , Nevid 2007)), we do not intend to claim this to be the "correct" theory. 
On the contrary, any objections to the theory that our formalization may raise are a further 
demonstration of the benefits of formalizing psychological knowledge. 

The paper is organized as follows. We start with some background on ASP and on the 
representation of dynamic domains. Next, we provide an account of the mechanics of STM 
and chunking, as it is commonly found in psychology literature. Then, we describe our 
ASP-based formalization of the mechanics of STM and chunking, and show how the for- 
malization can be used for the task of predicting a user's interaction with a graphical in- 
terface. Finally, we conclude with a brief discussion on what we have achieved and on 
possible extensions. 

2 Answer Set Programming and Dynamic Domains 

Let us begin by giving some background on ASP. We define the syntax of the language 
precisely, but only give the informal semantics of the language in order to save space. We 
refer the reader to (Gelf ond and Lifschitz 199ll [Niemela and Simons 20Q0J for a specifi- 
cation of the formal semantics. Let S be a signature containing constant, function and 
predicate symbols. Terms and atoms are formed as usual in first-order logic. A (basic) lit- 
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eral is either an atom a or its strong (also called classical or epistemic) negation ->a. A rule 
is a statement of the form: 

hi V ... V hk «- h, ■ ■ ■ , l m , not l m +i, • • ■ , not l n 

where hi's and li's are ground literals and nof is the so-called default negation. The intu- 
itive meaning of the rule is that a reasoner who believes {1%, . . . , l m } and has no reason 
to believe {l m +i, . . . , l n }, must believe one of hi's. Symbol <— can be omitted if no li's 
are specified. Often, rules of the form h -s— not h, h, . . . , not l n are abbreviated into 
<— h,... ,not l n , and called constraints. The intuitive meaning of a constraint is that 
{li, . . . , l m , not Z rn+ i, . . . , not /„} must not be satisfied. A rule containing variables is 
interpreted as the shorthand for the set of rules obtained by replacing the variables with all 
the possible ground terms. A program is a pair (E, II}, where £ is a signature and II is a set 
of rules over S. We often denote programs just by the second element of the pair, and let 
the signature be defined implicitly. Finally, the answer set (or model) of a program II is the 
collection of its consequences under the answer set semantics. Notice that the semantics of 
ASP is defined in such a way that programs may have multiple answer sets, intuitively cor- 
responding to alternative views of the specification given by the program. In that respect, 
the semantics of default negation provides a simple way of encoding choices. For example, 
the set of rules {p <— not q. q ■(— not p.} intuitively states that either p or q hold, and 
the corresponding program has two answer sets, {p}, {q}. Because a convenient represen- 
tation of alternatives is often important in the formalization of knowledge, the language 
of ASP has been extended with constraint literals (Niemel a and Simo ns 2000), which are 
expressions of the form m{li, I2, ■ ■ ■ , h}n, where to, n are arithmetic expressions and li's 
are basic literals as defined above. A constraint literal is satisfied whenever the number of 
literals that hold from {1%, . . . , l^} is between to and n, inclusive. Using constraint literals, 
the choice between p and q, under some set of conditions T, can be compactly encoded 
by the rule l{p, q}l 4— T. A rule of this form is called choice rule. To further increase 
flexibility, the set {l\, . . . , Ik] can also be specified as {l(X) : d(X)}, where X is a list 
of variables. Such an expression intuitively stands for the set of all l(x) such that d(x) 
holds. We refer the reader to (Nie mela and Si mons 2000]) for a more detailed definition of 
the syntax of constraint literals and of the corresponding extended rules. 

Because of the dynamic nature of STM, for its formalization we use techniques 
from the area of the representation of dynamic (or evolving) domains. The key el- 
ements of the representation techniques are presented next; we refer the readers to 
e.g. dGelfond and Lifschitz 199 8, Gelfo nd"2002l) for more details. Fluents are first-order 
ground terms, and intuitively denote the properties of interest of the domain (whose 
truth value typically depends upon time). For example, an expression of the form 
on(blocki : block,2) is a fluent, and may mean that block\ is on top of block2- A fluent 
literal is either a fluent / or its negation (->/). Actions are also first-order ground terms. 
For example, move{block^, block2) may mean that blocks is moved on top of block2- A 
set of fluent literals is consistent if, for every fluent /, / and ->/ do not both belong to 
the set. A set of fluent literals is complete if, for every fluent /, either / or -1/ belong 
to the set. The set of all the possible evolutions of a dynamic domain is represented by a 
transition diagram, i.e. a directed graph whose nodes - each labeled by a consistent set of 
fluent literals - correspond to the states of the domain in which the properties specified are 
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respectively true or false, and whose arcs - each labeled by a set of actions - correspond 
to the occurrence of state transitions due to the execution of the actions specified. When 
complete knowledge is available about a state, the corresponding set of fluent literals is 
also complete. When instead a set of fluent literals is not complete, that means that the 
knowledge about the corresponding state is incomplete (e.g. it is unknown whether / or 
-1/ holds). Incomplete or partial states are typically used to represent uncertainty about 
domains. 

Because the size of transition diagrams grows exponentially with the increase of the 
number of properties of the domain and actions, a direct representation is usually im- 
practical. Instead, transition diagrams are encoded using an indirect representation, based 
on the research on action languages (Gelf ond andLifschitz 1998b . Because we are not 
aware of any well-established action language that combines all the features needed for 
our model, and because in this paper the focus is not on developing a new action lan- 
guage, here we adopt the variant of writing such encoding directly in ASP - see e.g. 
dBalduccini et al. 20001) . In fact, because of the high level of abstraction of ASP, encoding 
knowledge (even about dynamic domains) directly in ASP rather than in action languages 
is nowadays common practice (see e.g. (Delgrande et al. 2009 Thielscher 2009) for some 



recent examples). 

The encoding is based on the notion of a path in the transition diagram from a given 
initial state, corresponding to a particular possible evolution of the domain from that initial 
state. The steps in a path are identified by integers (with denoting the initial state), and 
logical statements (often called laws) are used to encode, in general terms, the transitions 
from one step to the next. The fact that a fluent / holds at a step i in the evolution of 
the domain is represented by the expression h(f, i), where relation h stands for holds. 
If -1/ is true, we write ^/i(/, i). Occurrences of actions are represented by expressions 
of the form o(a, i), saying that action a occurs at step i (o stands for occurs). An action 
description is a collection of laws describing the evolution of the domain. Given an action 
description AD, a description of the initial state Co (e.g. Co = {h(fi, 0), _| /i(/2, 0), . . .}), 
and a sequence of occurrences of actions a (e.g. a = {o(ai, 0), 0(03, 0), 0(04, 1), . . .}), 
the corresponding path(s) in the transition diagram can be computed by finding the answer 
set(s) of AD U cr U a. 

3 Short-Term Memory 

Short-Term Memory is "the memory storage system that allows for short-term re- 
tention of information before it is either transferred to long-term memory or forgot- 
ten" fNevid 2007). This view is based on the so called three-stage model of memory 
( I Atkinson and Shiffrin 19711) : sensory inputs are first stored in Sensory Memory, which 
is very volatile and has large capacity; then, a portion of the inputs is processed - and 
possibly transformed into more rich representations - and moved to Short-Term Memory, 
which is less volatile than Sensory Memory, but of limited capacity. Short-Term Memory is 
also often viewed as a working memory, i.e. as a location where information is processed 
dCardet al. 19831) . Finally, selected information is moved to Long-Term Memory, which 
has larger capacity and longer retention periods. 

Beginning in the 1950s, several studies have been conducted to determine the capacity of 



Formalization of Psychological Knowledge in ASP and its Application 5 

STM. Miller (Miller 1956) reported evidence showing that the capacity of STM in humans 
is of 7 pieces of information. Later studies have lowered the capacity limit of STM to 
about 4 pieces of information (e.g. dCowan 200 0)). Interestingly, the limit on the number of 
pieces of information that STM can hold does not affect directly the amount of information 
(in an information-theoretic sense) that STM can hold. In fact, STM appears to be capable 
to storing references to concepts that are stored in Long-Term Memory. Although one such 
reference counts as a single piece of information toward the capacity limit of STM, the 
amount of information it conveys can be large. For example, it has been observed that it is 
normally difficult for people to remember the 12-letter sequence CN NIB MMT VU SA, 
while most people have no problems remembering the sequence CNN IBM MTV USA, 
because each triplet refers to a concept stored in Long-Term Memory, and can thus be 
represented in STM by just 4 symbols (Kassin 2006). The phenomenon of the detection 
and use of known patterns in STM is referred to as chunking. 

Another limit of STM is that the information it contains is retained only for a short period 
of time, often set to about 30 seconds by researchers ((Nevid 2007); notice however that the 
issue of a time limit on the information stored in STM is somewhat controversial - see e.g. 
dCowan 2000UCard et al. 1983b )l 1 lThis limit can be extended by performing maintenance 
rehearsal, which consists in consciously repeating over and over the information that needs 
to be preserved. To increase the flexibility of our formalization, in the next section we 
abstract from specific values for the limits of capacity and retention over time, and rather 
write our model in a parametric way. This makes it possible, among other things, to use our 
formalization to analyze the effects of different choices for these parameters, effectively 
allowing us to compare variants of the theory of STM. 

4 A Formalization of Short-Term Memory 

Using a common methodology in ASP-based knowledge representation, we begin our dis- 
cussion on the formalization by condensing the description of STM and chunking in a 
number of statements still written in natural language, but precisely formulated. Later, we 
will encode those statements using ASP. The statements are: (1) STM is a collection of 
symbols; (2) The size of STM is limited to ui elements; (3) Each symbol has an expira- 
tion time associated with it (saying when the piece of information will be "forgotten"); 
(4) Symbols are divided into primitive symbols and chunk symbols; (5) Primitive sym- 
bols represent concepts that are innate in the brain (e.g. are direct encoding of sensory 
input); (6) Chunk symbols represent concepts that are stored in Long-Term Memory; (7) 
New symbols can be added to STM. If a symbol is added to STM when u elements are 
already in STM, the symbol that is closest to expiring is removed from STM ("forgotten"). 
In the case of multiple symbols equally close to expiring, one is selected arbitrarily; (8) 
When a symbol is added to STM, or when a symbol from STM is used (in particular, when 
performing maintenance rehearsal), its expiration time is reset to a constant value e; (9) 
Simplifying assumption: only a single operation (where by operation we mean either ad- 
dition or use) can occur on STM at any given time. The next set of statements describes 

1 For example, according to (Card et al. 1983 ), decay is affected by variables such as the number of chunks that 
the user is trying to remember, and retrieval interference with similar chunks. 
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the mechanism of chunking: (10) A chunk is a set of (primitive or chunk) symbols; (1 1) A 
chunk symbol is a symbol that uniquely denotes a chunk; (12) A chunk is detected in STM 
if all the symbols that it consists of are in STM; (13) When a chunk is detected in STM, 
the symbols it consists of are removed from STM and the corresponding chunk symbol 
is added to STM; (14) A symbol can be inferred from STM if it belongs to STM, or if it 
is part of a chunk that can be inferred from STM;|j (15) Simplifying assumption: chunks 
can only be detected when STM is not in use (i.e. no addition or use operations are being 
performed); (16) Simplifying assumption: at every step, at most one chunk can be detected. 

Now we are ready to focus on the formalization of STM and chunking in ASP. Flu- 
ent instm(s) says that symbol s (where s is a possibly compound term) is in STM; 
expiration(s, k) says that symbol s will expire (i.e. will be "forgotten") in k units of 
time, unless the expiration counter is otherwise altered. Action store(s) says that symbol 
s is stored in STM; use(s) says that s is used (typically, during maintenance rehearsal). 
Relation primitive(s) says that s is a primitive symbol; chunks are described by relation 
chunk(s), saying that s is a chunk symbol, and by relation chunk _element(s , s'), stating 
that s' is a component of the chunk identified by s; relation symbol(s) says that s is a 
symbol (either primitive or chunk). Relation stm-max_size(uj) says that the size of STM 
is limited to u elements; stm_expiration(e) states that the symbols in STM expire after 
e units of time. Finally, in order to update the expiration counters based on the duration of 
the actions executed at each step, relation dur(i, d) says that the overall duration of step i, 
based on the actions that took place, is d units of time. 

We divide the axioms in our formalization of the STMf| in a number of categories. We 
begin with the axioms that describe the effect of storing a symbol s in STM. The first axiom 
says that an effect of storing s in STM is that s becomes part of STM: 

h(instm(S),I + 1) <- o(store(S),I). 

The next axiom says that, if adding a symbol to STM would cause the STM size limit 
to be exceeded, then the symbol that is closest to expiring will be forgotten. Notice that 
the simplifying assumptions listed among the natural language statements above guarantee 
that there is a unique such symbol, thus simplifying the writing of the axiom (lifting the 
assumptions is easy, but would lengthen the presentation). 

-ih(instm(S2),I + l) <- 

51 / 52, o(store(Sl),I), stm_max_size(MX), 
curr_stm_size(MX,I), not some-expiring(I), oldest Jnstm(S2, 1). 

The axiom relies on a number of auxiliary relations. These relations depend on the current 
step in the evolution of the domain, and hence could be represented as fluents. However, to 
stress the distinction between "regular" fluents and auxiliary relations, we prefer the alter- 
native writing that does not rely upon relation h, thus writing e.g. currstmsize(MX, I) 
instead of h(curr_stm_size(MX), I). Relation currstmsize(w, i) says that the size of 
STM at step i is w; some-expiring(i) states that some symbol in STM will be forgotten 

2 Notice that this statement can be applied recursively. 
To save space, we omit most atoms formed by domain predicates and, in a few rules, use default negation 
directly instead of writing a separate rule for closed- world assumption. For example, if p holds whenever q is 
false and q is assumed to be false unless it is known to be true, we might write p <— not q instead of the more 
methodologically correct {p <— -iq. -iq <— not q.}. 
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when the domain moves to the next step; oldest Jnstm(s,i) says that s is the symbol 
closest to expiring. The accurate reading of the above axiom is then: if s\ is stored in STM 
at step i, and STM already contains u> symbols, none of which are due to expire at the 
next time step, then the oldest symbol in STM will no longer be in STM at step i + 1. The 
auxiliary relations are defined as follows: 



oldest jn_stm(S, I) <— 

h(expiration(S, E),I), 

not smaller _expiration(E, I). 

smaller _expiration(El,I) <— 

h(expiration(S, E2), I), 
E2 < El. 

curr_stmsize(N, I) <— 

N { h(in.stm(S),I) : symbol(S) } N. 

expiring(S,I) <^- 

h(expiration(S, SK), I), 
dur(I,D),SK < D. 

some.expiring(I) <— 

expiring(S,I). 



The next axiom states that an effect of storing s in STM is that the expiration time of s is 
set to e: 

h(expiration(S,E),I + 1) -«— stm_expiration(E) , o(store(S) , I) . 



The next axiom (together with some auxiliary definitions) says that it is impossible for two 
STM-related actions to be executed at the same time. 



<- o(Al,I),o(A2,I), Al -£ A2, stm.related(Al), stmjrelated(A2). 
stmjrelated(store(S)) -s— symbol(S) . 
stmjrelated(use(S)) <— symbol(S) . 

The final axiom in this category states that, whenever a symbol from STM is used, its 
expiration time is reset. 

h(expiration(S, E), I + 1) <— 
stm_expiration(E) , 
o(use(S),I), 
h(instm(S) , I) . 

The second group of axioms describes the mechanism of chunking. Three axioms state 
that, when the components of a chunk are detected in STM, the corresponding symbols are 
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replaced by the chunk symboQ whose expiration is set to e. 

->h{instm{8),I + 1) <- 
detect ed(C, I), 

chunk _element(C ', S). 

h(instm(C),I+ 1) <- 
detected(C, I). 

h(expiration(C, E), I + 1) <— 
stm_expiration(E) , 
detected{C , I) . 

Auxiliary relation detected(C, I), used above, says that chunk C has been detected in STM 
at step I. The detection occurs, when STM is not in use as per our simplifying assumption 
above, by checking if there is any chunk whose components are all in STM. If symbols 
corresponding to multiple chunks are available in STM, only one chunk is detected at 
every step, as per assumption (16) on page [6] The choice of which chunk is detected is 
non-deterministic, and encoded using a choice rulqj, as follows: 

1 { detected(C ', I) : -^chunk_elementjmissing(C, I) } 1 <— 
stmj,dle(l) , 
chunk _detectable(I). 

chunk.detectable(I) <— 
step(I), 
chunk(C), 
stmjidle(I) , 
-ichunk_elementjmissing(C , I). 

^chunk-dement .missing (C ', I) <— 
chunk(C), 
not chunk. element jmis sing (C, 7). 

chunk_element-.missing(C,I) <s— 
chunk _element(C, S), 
-^h(instm(S),I). 

-istmAdle(I) «— 
o(A,I), 
memory _related( A). 

stmJdle(I) <— 

not —istmJ,dle{I). 

It is interesting to note that the components of a detected chunk are allowed to be located 
anywhere in STM. However, now that the model is formalized at this level of detail, one 
cannot help but wonder whether in reality the focus of the mechanism of chunking is on 
symbols that have been added more recently. We were unable to find published studies 
regarding this issue. 

The final group of axioms deals with the evolution of the contents of STM over time, 

4 Our simplifying assumption that at most one chunk can be detected at every step ensures that the number of 
items in STM does not increase as a result of the chunking process. 

5 Readers who are familiar with ASP may notice that we allow the use of non-domain predicate 
-^chunk-element jmissing(C, I) in the head of the choice rule. This is done to keep the presentation short. 
From the perspective of the implementation, when using ASP parsers that expect a domain predicate, one 
would have to use a slightly longer encoding. 



Formalization of Psychological Knowledge in ASP and its Application 9 

both when store or use actions occur, and when they do not. Notice that action theories 
often assume that fluents maintain their truth value by inertia unless they are forced to 
change by the occurrence of actions. In the case of fluents instm(s) and expiration(s, k), 
however, the evolution over time is more complex (and such fluents are then called non- 
inertial). In fact, for every symbol s in STM, expiration(s,e) holds at first, but then 
expiration(s,e) becomes false and expiration(s,e — 6) becomes true, where 8 is the 
duration of the latest step, and so on. On the other hand, instm(s) is true if-and-only-if 
expiration(s, k) holds for some k > 0. The following axioms accomplish three tasks: 
they define the behavior of inertial fluents, using a rather standard ASP encoding of the 
inertia axiom, which relies on ASP's ability to encode defaults (the rule for ~^h(F, I + 1), 
omitted, is similar); they state that fluents in_stm(s) and expiration(s, k) are non-inertial 
(the reason for doing so explicitly will become clear in the next section); and formalize the 
default evolution of the non-inertial fluents' truth value over time. Notice that relation 
expiring, used here, was defined above (pageQ. 

h(F, I + 1) «- 

h(F,I), 

not -<h(F,I + l), 

not noninertial(F) . 

noninertial(in_stm(S)) <— 
symbol(S) . 

noninertial(expiration(S, E)) •<— 
symbol(S) , 
expiration.value(E). 

h(in.stm(S) , I + 1) «- 

h{in_stm{S),I), 

not expiring(S,I), 

not -ih{injatm{S) , I + 1) . 

-ih(in_stm(S) , I + 1) «- 
h(in.stm(S),I), 
expiring(S,I), 
not h(in_strn(S),I + 1). 

-ih(instm(S) , I + 1) «- 

-*h(in-stm(S),I), 

not h(in_stm(S) , I + 1) . 

h(expiration(S, E — D),I + 1) <— 
expiration jvalue(E) , 
h(expiration(S, E),I), 
dur(I,D),E > D, 

not different_expiration(S, E — D, I + 1), 
not ->h(in_stm(S) , I + 1). 

different_expiration(S, El, I) <— 
expiration.value(El), 
expirationjvalue(E2), 
E2^E1, 
h(expiration(S, E2),I). 

To demonstrate that our formalization captures the key features of the mechanics of STM 
and chunking, we subject it to a test of memory span. In a memory-span test, a subject is 
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presented with a sequence of digits, and is asked to reproduce the sequence (lKassin 2006). 
By increasing the length of the sequence and by allowing or avoiding the occurrence of 
familiar sub-sequences of digits, one can verify the capacity limit of STM and the role 
of chunking. Because here we are only concerned with correctly modeling STM, we ab- 
stract from the way digits are actually read, and rather represent the acquisition of the 
sequence of digits directly as the occurrence of suitable store(s) actions. Similarly, the 
final reproduction of the sequence is replaced by checking the contents of STM at the end 
of the experiment. As common in ASP, all computations are reduced to finding answer sets 
of suitable programs, and the results of the experiments are determined by observing the 
values of the relevant fluents in such answer sets. 

From now on, we refer to the above formalization of STM by Hstm- Boundary condi- 
tions that are shared by all the instances of the memory-span test are encoded by the set 
lip of rules, shown below. The first two rules of lip set the value of cj to a capacity of 
4 symbols (in line with (Cow an~20001 l) and the value of e to 30 time units. The next rule 
states that each step has a duration of 1 time unit. This set-up intuitively corresponds to 
a scenario in which STM has a 30 second time limit on the retention of information and 
the digits are presented at a rate of one per second. The last three rules define the set of 
primitives for the memory-span test. We use the expression seq(p, d) to represent the fact 
that digit d is at position p in the sequence. 

stmjmaxsize(4) . 
stm_expiration(30) . 

dur(I,l). 

position(l) . position(2) . ... position(6). 
digit(0). digit(l). . . . digit(9). 
primitive(seq(P, D)) <f- position(P), digit(D). 

The initial state of STM is such that no symbols are initially in STM. This is encoded by 

&STM'- 

-^h(in_stm(S) , 0) <— symbol(S). 

In the first instance, STM is presented with the sequence 2, 4, 5, 7. Human subjects are 
normally able to reproduce this sequence. Let us see if our formalization can do the same. 
The sequence of digits is encoded by set SPANi of rules: 

o(store(seq(l, 2)), 0). o(store(seq(2, 4)), 1). 
o(store(seq(3, 5)), 2). o(store(seq(4, 7)), 3). 

To predict the behavior of STM and determine which symbols will be in it at the end of 
the experiment, we need to look at the path in the transition diagram from the initial state, 
described by cfstm, an d under the occurrence of the actions in SPANi, As explained 
earlier in this paper, this can be accomplished by finding the answer set of III = Hstm U 
lip U a stm U SPANi, It is not difficult to check that, at step 4 (corresponding to the end 
of the experiment), the state of STM is: 

h(in_stm(seq(l, 2)), 4), h(instm(seq(2, 4)) , 4) , 
h(instm(seq(3, 5)), 4), h(instm(seq(4, 7)), 4), 
h(expiration(seq(l, 2), 27), 4), h(expiration(seq(2, 4), 28), 4), 
h(expiration(seq(3, 5), 29), 4), h(expiration(seq(4:, 7), 30), 4), 

which shows that the sequence is remembered correctly (and far from being forgotten, as 
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the expiration times show). Let us now consider another instance, in which the sequence 
of digits is 2, 4, 5, 7, 3. The corresponding store(s) actions are encoded by SPAN2 — 
SPANi U {o(store(seq(5, 3)), 4)}. This sequence is beyond the capacity of STM stated 
in lip. Human subjects are unable to reproduce sequences beyond the capacity of STM 
(unless chunking occurs). Our formalization of STM exhibits the same behavior. In fact, 
according to the answer set of program n 2 = Hstm U lip U <jstm U SPAN 2 the state 
of STM at the end of the experiment is: 

h(instm(seq(2, 4)) , 5), h(instm(seq(S, 5)), 5), 
h(in_stm(seq(4:, 7)), 5), h(instrn(seq(5, 3)), 5). 

As expected, the first element of the sequence has been forgotten. In the next instance, we 
consider the sequence 5,8,5 — 8, 0, 6, where "— " represents a 1-second pause in the pre- 
sentation of the digits. This sequence is, in principle, beyond the capacity of STM, but we 
further assume familiarity with the area codes 585 and 806. Under these conditions, human 
subjects have demonstrated to be capable of remembering the sequence after noticing the 
presence of the area codes in them|3 The knowledge about the area codes is encoded by 
the set Ti of rules: 

chunk(seq(P, ac(roc))). 
chunk_element(seq(P, ac(roc)), seq(P, 5)). 
chunk-element(seq(P, oc(roc)), seq(P + 1, 8)). 
chunk_element(seq(P, ac(roc)), seq(P + 2, 5)). 

chunk(seq(P, ac(lbb))). 
chunk_element(seq(P, ac(lbb)) , seq(P, 8)). 
chunk_element(seq(P, ac(lbb)), seq(P + 1, 0)). 
chunk_element(seq(P, ac(lbb)), seq(P + 2, 6)). 

The sequence of digits is encoded by SPAN 3 : 

o(store(seq(l, 5)), 0). o(store(seq(2, 8)), 1). 
o(store(seq(3, 5)), 2). % no action at step 3 
o(store(seq(4:, 8)), 4). o(store(seq(5, 0)), 5). 
o(store(seq(6, 6)), 6). 

At the end of the experiment (we select step 8 to allow sufficient time for the chunking of 
the second triplet to occur), the state of STM predicted by our formalization is: 

h(in_stm(seq(l, ac(roc))), 8), 
h(in_stm(seq(4:, ac(lbb))), 8), 
h(expiration(seq(l, ac(roc)), 26), 8), 
h(expiration(seq(4:, ac(lbb) ) , 30) , 8) 

which shows that the chunking of the two area codes has occurred, allowing STM to store 
a sequence of digits that is longer than u symbols. In the final instance of this section, we 
consider two additional chunks, 5,8,5,2 (supposedly a pin number), and 1, 3 (considered 
an unlucky number in various cultures), which are encoded with the same technique shown 
above. The corresponding set of rules T[ is obtained from I\ by adding the encoding 



6 The 1-second pause is used to allow sufficient time for the detection of the first chunk. Subject studies have 
shown that chunk detection does not occur or occurs with difficulty when the stimuli are presented at too high 
a frequency. 
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of the new chunks, called myjpin and unlucky _13. The sequence to be remembered is 
5, 8, 5 - 2, 1, 3, encoded by SPAN 4 : 

o(store(seq(l, 5)), 0). o(store(seq(2, 8)), 1). 
o(store(seq(3, 5)), 2). % no action at step 3 
o(store(seq(4, 2)), 4). o(store(seq(5, 1)), 5). 
o(store(seq(6, 3)), 6). 

Notice that, at step 7, chunks myjpin and unlucky A3 will both be available for detection. 
The literature does not specify any particular order in which the chunks are detected by the 
brain, and thus one should assume that the order of detection is arbitrary|3 It is not diffi- 
cult to show that our formalization correctly yields two answer sets, both predicting a final 
state of STM in which myjpin and unlucky AS are in STM, but differing for the order 
which chunking occurs. One answer set, in fact, predicts the chunking of myjpin first: 
{detected(seq(l, myjpin), 7), detected(seq(5, unlucky A3), 8)}, while the other pre- 
dicts the chunking of myjpin last: {detected(seq(5, unlucky _13), 7), detected(seq(l, 
myjpin), 8)}. Although space restrictions prevent us from formalizing alternative theories 
of STM and perform an analytical comparison, it is worth noting that even the single for- 
malization developed allows comparing (similar) variants of the theory corresponding to 
different values of u) and e. One could for example repeat the above experiments with dif- 
ferent parameter values and compare the predicted behavior with actual subject behavior, 
thus confirming or refuting some of those variants. 



5 A Practical Application 

The availability of a formalization of psychological knowledge not only allows better anal- 
ysis, comparison, and verification of psychological theories, but may also have more imme- 
diate practical applications. In this section we show how our formalization can be used to 
predict a user's difficulties in interacting with a graphical user interface. Psychological the- 
ories used in human-computer interaction (e.g. dCard et aT7 l983; Kie ras and Poison 19831 
IKieras and Polson"T9 85 )) for this kind of evaluations are often of a qualitative nature and 
do not allow for precise quantitative predictions. The ability to accurately encode this type 
of knowledge in ASP allows one to use psychological theories to draw accurate conclu- 
sions, and to do so automatically. We believe this to be a clear step forward from the use of 
rules of thumb and guidelines common nowadays in human-computer interaction. We con- 
sider a scenario in which the user is told a sequence of tasks (menu clicks, tab selections, 
etc.) to be performed, and is expected to execute it without being reminded about any task. 
If the user succeeds, then that means that the sequence can be stored completely in STM 
(possibly chunked), and that the sequence is also short enough not to be forgotten during 
execution. This scenario corresponds for example to a situation in which a user is trying 
to follow the instructions on a help page. To begin, we need to formalize the sequence 
of tasks and its execution. Basic operations are clickim), corresponding to clicking menu 
item to; select(t), meaning that tab t in a dialog box is selected; check(c), correspond- 
ing to putting a checkmark in checkbox c. The primitive STM symbols are of the form 



Simplifying assumption (16) prohibits concurrent detection. 
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task(n, op), where op is one of the basic operations, and n is its index in the sequence. 
The sequence is stored in STM by means of occurrences of store(s) actions, as before. 
We use term task(n, op) also as one of the fluents of the formalization. Its meaning is that, 
based on the state of the reasoner's memory, op is the n th task in the sequence. The fluent 
is non-inertial, because it is defined directly by the current state of STM: 

noninertial(task(N, A)). 

h{task{N,A),I) «- stmJi(task(N,A),I). 

stmJi(S,I) <— h(instm(S),I). 

stm_h(S,I) <— stmJi(C,I),chunk_element(C,S). 

From the point of view of knowledge representation, it is worth noting how ASP makes 
it easy to express the recursive definition of relation stmJi(s, i). The other key fluents of 
the formalization of the sequence are current _task(n), which says that the current task is 
the n th in the sequence, task-forgotten(n), meaning that the n task has been forgotten, 
and completed, meaning that the sequence has been completed. Task number 1 is selected 
as current task as soon as it becomes available (the use of defaults here and later greatly 
shortens the representation): 

h(current_task(l) , I) <— hasJ,ask{l,I), not ->h(current-ta$k{l) , I) . 

Relation hasJask(n, i) (definition omitted to save space) says that, at step i, the sequence 
has a task with index n. Performing the current task makes the following task become 
current, unless the task that was performed was the last in the sequence, in which case the 
sequence is complete: 

h(current-task(N + 1), / + 1) «— 

o(A, I),h(currentJ,ask(N), I), 
h(task(N, A), 7), not lastJask(N, I). 

h(completed, I + 1) •<— 

o(A, T), h{currentJask(N),I), 
h(task(N, A), I), lastJask(N, I). 

(Other rules that ensure that only one task is current at any time are omitted.) The reasoner 
detects that the current task has been forgotten when it cannot recall it: 

h(task_forgotten(N), I) <— h(current-task(N),I), not hasJ,ask(N,I). 

Task execution is encoded by a rule saying that, if the current task is executable, then the 
reasoner will perform it: 

o(A, T) «- h(current-task(N),I), h(task(N, A), I), 
current Jbask..executable(I) . 

current-task-executable(I) 4— 

h(current_task(N),I),h(task(N,A),I), not -*o(A,I). 

Finally, we make the duration of the user's actions depend on his skill level (the rules are 
straightforward and omitted to save space). For expert users, every action takes 1 unit of 
time. For beginner users, who need to scan the screen to look for the items to act upon, 
clicking a menu takes 3 units; clicking a sub-menu takes 5; selecting a tab in a dialog, as 
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well as putting a checkmark in a checkbox, takes 7J3 We denote the above set of rules by 
Htask- To test our formalization, we consider an example in which a beginner user is 
given the following sequence, UI\ . (STM-related parameters are u) = 4 and e = 30 as 
before.) 

o(store(task(l, click(m(tools)))) , 0). 
o(store(task(2, click(subm(options)))) , 1). 
o(store(task(3,select(tab(text)))),3). 
o{store(task(A,check(ck{useJiardtabs)))),A). 
o(store(task(5,select(tab(highlighting)))),6). 
o(store(task(6, check(ck(detect .language)))) , 7). 
—o(A,I) <— external SLCtion( A), I < 8. 

Notice the pauses at steps 2, 5, 8, to allow enough time for chunking to occur. The last rule 
ensures that the reasoner waits to have acquired the complete sequence before beginning 
to execute it. The user is assumed to be familiar with the combinations "click tools, click 
options," and "select tab 'text', put a checkmark on 'use hard-tabs'." The encoding, Tg, of 
the first chunk is: 

chunk(task(N , tools. options)). 

chunk _element(task(N , tools .options), task(N, click(rn(tools)))). 

chunk _element(task(N ,tools .options) , task(N + 1, click(subm(options)))) . 

The second chunk is encoded similarly. The encoding of the two chunks is denoted by 
IV Because of our assumption that beginner users take some time to locate the graphical 
items, one can expect that the user will forget the sequence before completing it. The use 
of our formalization allows one to make the prediction more precise. In fact, the answer 
set of the program U^ 1 = Hstm U Htask U ctstm U UI\ U 1^ shows that: (1) chunking 
occurs and the sequence fits in STM, as shown in the answer set by atoms such as: 

h(instm(task(l,tools_options)), 12), 
h(in_stm(task(3,text_htabs)) , 12), 
h(in_stm(task(5, select(tab(highlighting)))) , 12), 
h(in_stm(task(6, check(ck(detectjanguage)))), 12). 

However, the answer set also shows that (2) the last item of the sequence is forgotten right 
before it can be executed: 

h(current_task(5) , 13), 
o(select(tab(highlighting) ) , 13) , 
h(currentjtask{6), 14), 
h(task_forgotten(6), 14). 

In another example, we consider an expert user familiar with the "click tools, click op- 
tions," sequence, but not with the "select tab 'text', put a checkmark on 'use hard-tabs'" 
sequence. This knowledge is encoded by T%, shown earlier. The prediction of the user's 
behavior is given by program U^ 1 = Hstm U Utask U <jstm U UIi U rf, and it is not 



' These figures are meant to be reasonably realistic, but they are not the result of an accurate study. Obviously, 
the conclusions of our work hold independently of the particular numbers chosen. 
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difficult to check that the sequence does not fit in STM: 

h{current-task{V) , 8), 

h(in_stm(task(3, select(tab(text)))) ,8) , 
h(instm(task(4, check(ck(use.hardtabs)))) ,8) , 
h(in_stm(task(5, select(tab(highlighting)))) , 8) , 
h(in_stm(task(6, check(ck(detectjanguage)))),8), 
h(task_forgotten(l),8). 

In the final example, an expert user is familiar with both sequences of tasks. The corre- 
sponding program is II^ 7 = Ustm U Utask U o~stm U UI\ U 1^. The answer set of 
Ilg 7 shows that the user is predicted to apply chunking, remember the sequence through 
the end, and complete its execution: 

h(completed, 15), 

h(instm(task(l, tools .options)) , 15), 
h(in_stm(task(3,text_htabs)) , 15), 
h(in_stm(task(5, select(tab(highlighting)))) , 15), 
h(in_stm(task(6, check(ck(detectjanguage)))), 15). 

It is worth stressing that the ASP programs described above are not only directly exe- 
cutable, but the corresponding computation by state-of-the-art ASP systems is quite fast. 
The answer sets for the examples in this section were each computed in less than a second 
by CLASP on a computer with i7 CPU, 2.93GHz, 8GB RAM. Scaling is also good: repeat- 
ing the experiments for values of to and e, respectively, 7 and 100 increased computation 
time to less than 3 seconds. 



6 Discussion 

In this paper we have shown that it is indeed possible and important to formalize psy- 
chological knowledge that is of qualitative or logical nature, and that ASP is suitable for 
the task. The formalization allows analysis and comparison of theories. It also allows one 
to predict the outcome of experiments, thus making it possible to design better experi- 
ments. Various reasons make the formalization of knowledge of this kind challenging. As 
we hope to have demonstrated, ASP allows one to tackle the challenge, thanks to its abil- 
ity to deal with common-sense, defaults, uncertainty, non-deterministic choice, recursive 
definitions, and evolving domains. To highlight the benefits of the formalization of psy- 
chological knowledge, it is worth stressing how, earlier in the paper, the availability of the 
formalization of STM allowed us to point out that the role of more recent symbols in the 
mechanics of chunking may deserve further investigation. 

We believe it is difficult to find other languages that allow writing a formalization at the 
level of abstraction of the one shown here, and that are at the same time directly executable. 
As a remarkable example, consider how naturally and elegantly inertial and non-inertial 
fluents coexist in our model, and how the introduction of non-inertial fluents occurs in a 
fully incremental fashion: for most of Section |4] the model uses only inertial fluents, and 
one could safely adopt the standard writing of the axioms of inertiajj When the need to 
deal with non-inertial axioms arises later in the section, all that needs to be done is to add 

9 In Sectionf4] we showed directly the more advanced version of the inertia axiom(s) to save space. 
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the condition "not noninertial(F)" to the inertia axioms, and to write suitable definitions 
of the new fluents. It has been extensively demonstrated in the literature on ASP that the 
elaboration tolerance shown in this case is not a coincidence, but a key property of the 
ASP paradigm. Furthermore, direct executability of ASP provides a unique opportunity to 
bridge the gap between formulation of psychological theories and practical applications, 
as we have shown in the previous section. 

Finally, in this paper we have discussed the interaction with a simple graphical interface, 
and have barely touched upon related topics such as user expertise. However, we expect that 
similar techniques to the ones shown here can be used to accurately model the role of exper- 
tise and attention (e.g. in particular in critical tasks (McCarley et al. 2002)). We believe that 



other important theories about the mechanics of the brain, e.g. Long-Term Memory, can 
be formalized following the approach we presented. The formalization of STM itself could 
also be made richer, for example by using additive fluents dLee and Lifschitz"20 0l) to al- 
low concurrent STM updates, by modeling continuous decay (Chintabat hina et al. 2 005), 
and by using intentions (Bar al and Gelfond 20051 ) to represent task sequences. 
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